AD-AOn  176  NAVAL  POSTGRADUATE  SCHOOL  MONTEREY  CAL IP  F/6  9/2 

AN  APPLICATION  OF  CP/CMS  TO  THE  TIME  SERIES  ANALYSIS. (U) 

MAR  77  Y W LEE 


ft. 


SECURITY  CLASSIFICATION  Of  THIS  PAGE  (Whan  Data  Bnlatad) 


REPORT  DOCUMENTATION  PAGE 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


I REPORT  NUMBER 


2.  GOVT  ACCESSION  NO 


3.  RECIPIENT'S  CATALOG  NUMBER 


TITLE  (ard  Subtttla) 


An  Application  of  CP/CMS  to  the  Time  Series 
• Analysis 


*.  TYPE  OP  REPORT  * PERIOD  COVERED 

Master's  Thesisfr  L- — " 
March  1977 


*•  PERFORMING  ORG.  REPORT  NUMBER 


|-*-  AU  thok/n  . . 

Young  Woo  Lee 


t.  CONTRACT  ON  GRANT  NUMBER!*; 


f.  PERFORMING  ORGANIZATION  NAME  AND  ADORE?* 

Naval  Postgraduate  School 
Monterey,  California  93940 


10.  PROGRAM  ELEMENT.  PROJECT,  TASK 
AREA  6 WORK  UNIT  NUMBERS 


II.  CONTROLLING  OFFICE  NAME  AND  AOORESS 

Naval  Postgraduate  School 
Monterey,  California  93940 


t/2.  REPORT  OATE 

Mar A 1977 


is.  number  of  pAges 

in  . ; 


T7  MONITORING  AGENCY  name  4 ADDRESSOI  dUtaraot  from  Control  tins  Ollica) 

Naval  Postgraduate  School 
Monterey,  California  93940 


IS.  SECURITY  CLASS,  (ol  thta  raport) 

Unclassified 


ISa.  DECLASSIFICATION.  DOWNGRADING 
SCHEDULE 


16.  DISTRIBUTION  STATEMENT  (ol  thla  Raport) 

Approved  for  public  release;  distribution  unlimited. 


17.  DISTRIBUTION  STATEMENT  (o I the  ebetrect  entered  In  Block  20,  II  dlllerent  from  Report) 


1§.  supplementary  notes 


19-  KEY  WOROS  ( Continue  on  terete e side  It  neceeemrr  ond  Identity  by  block  number) 

Time  Series  Analysis 
Box-Jenkins 
Time  Sharing 

Interactive  Computer  Analysis 


20.  ABSTRACT  ( Continue  on  rereree  tide  II  neceeeery  end  Identify  by  block  mmtber) 

An  interactive  package  of  computer  programs  has  been  developed  for 


the  analysis  of  time  series  data.  The  package,  called  the  Time  Series 
Editor,  is  designed  around  the  Box-Jenkins'  statistical  methodology  of 
time  aeries  analysis.  The  Time  Series  Editor  was  developed  for  time-shared 


J 


DD  1473 


(Page  1) 


EDITION  OF  I NOV  ••  IS  OBSOLETE 
S/N  0 103-0  1«-  660  1 | 


SECURITY  CLASSIFICATION  OF  THIS  PAOE  (PPaa  Data  +<taradj 


J 


tuCuWITV  CL  ASAI  AIC  ITlQM  or  THIS  natiw»n  n.i.  (» i.r.i 


use  on  the  Controlled  Program/Cambridge  Monitor  System  (CP/CMS) 
but  could  be  easily  modified  to  accommodate  other  time-sharing 
systems.  The  Time  Series  Editor  assists  in  data  preparation, 
entry,  analysis  and  diagnostic  testing.  Utilization  of  the  package 
requires  only  a limited  knowledge  of  the  computer  system  with  all 
required  user  responses  prompted  by  the  Editor. 


DD  Form  14.3 
1 Jan  73 

S/N  0102-014-6601 


J 


*ICU"ITV  CLAUlflCATION  OA  Tm*  D«li 

2 


Approved  for  public  release;  distribution  unlimited. 


An  Application  of  CP/CMS 
To  The  Time  Series  Analysis 

by 


Young  Woo  Lee 

Lieutenant  Colonel,  Republic  of  Korean  Air  Force 
B.S.,  Korean  Air  Force  Academy,  1963 


Submitted  in  partial  fulfillment  of  the 
requirements  for  the  degree  of 


MASTER  OF  SCIENCE  IN  OPERATIONS  RESEARCH 


Author 


Approved  by : 


STU 

Witt  5*r»m  \ 

3 5 

Ur,  Set' »•  n 

»(/'  •'•■•r.tn 

□ 

Jl'l  ,ii.  . 

C;'P'J 

V' 

v 


from  the 


D 


NAVAL  POSTGRADUATE  SCHOOL 
March  1977 


\r 


y 


pA 


Liu 


)l::. 


$1 


J 


1 


ABSTRACT 

An  interactive  package  of  computer  programs  has  been  developed  for 
the  analysis  of  time  series  data.  The  package,  called  the  Time  Series 
Editor,  is  designed  around  the  Box-Jenkins'  statistical  methodology  of 
time  series  analysis.  The  Time  Series  Editor  was  developed  for  time- 
shared  use  on  the  Controlled  Program/Cambridge  Monitor  System  (CP/CMS) 
but  could  be  easily  modified  to  accommodate  other  time-sharing  systems. 
The  Time  Series  Editor  assists  in  data  preparation,  entry,  analysis  and 
diagnostic  testing.  Utilization  of  the  package  requires  only  a limited 
knowledge  of  the  computer  system  with  all  required  user  responses 
prompted  by  the  Editor. 
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INTRODUCTION 


Operations  researchers,  statisticians,  economists,  market- 
ing personnel,  managers  and  many  others  are  frequently  faced 
with  the  need  to  analyze  time  series  data.  In  most  cases 
their  objective  is  to  discover  patterns  or  recognizable  be- 
havior in  historical  data  that  can  be  used  to  construct  mathe- 
matical models  of  the  time  series  from  which  forecasts  of 
future  behavior  can  be  obtained.  The  importance  of  being  able 
to  forecast  future  behavior  accurately  cannot  be  overempha- 
sized. Whether  the  data  be  budget  expenditures,  populations, 
natural  resource  consumption,  prices,  demands,  economic  indi- 
cators, stock-market  prices,  manpower  levels  or  whatever, 
decision  makers  concerned  with  planning  for  the  future  must 
base  their  decisions  on  their  best  predictions  about  the 
future  behavior  of  the  time  series. 

Until  the  late  1960's,  the  analysis  techniques  were  pri- 
marily those  of  spectral  analysis  with  heavy  application  of 
harmonic  analysis  and  mathematical  transform  theory.  Because 
of  the  mathematical  sophistication  required  by  the  spectral 
approach,  the  analysis  capability  resided  fairly  exclusively 
in  the  hands  of  mathematicians  and  engineers.  Consequently, 
many  naive  forecasting  methods  such  as  moving  averages,  ex- 
ponential smoothing  and  decomposition  analyses  were  adopted 
by  the  majority  of  the  decision  makers.  Since  the  late  1960's 
the  statistical  analysis  of  time  series,  embodied  primarily 
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in  the  methodology  developed  by  Box  and  Jenkins  [Ref.  2], 
has  received  widespread  acceptance.  Because  the  Box-Jenkins 
approach  is  described  in  a vocabulary  more  familiar  to  opera- 
tions researchers,  statisticians,  economists  and  managers, 
more  and  more  business  and  government  decision  makers  are 
building  models  from  past  data  to  use  for  planning  into  the 
future . 

Many  algorithms  and  computer  programs  for  performing  the 
analyses  required  by  the  Box-Jenkins  approach  have  been  de- 
veloped and  are  readily  available  from  many  sources.  One  of 
the  best  sources  is  the  collection  of  FORTRAN  computer  sub- 
routines which  resides  in  the  International  Mathematical  and 
Statistical  Library  (IMSL)  [Ref.  4],  The  major  problem  with 
using  the  available  computer  resources  lies  not  with  any  de- 
ficiency of  the  algorithms  or  the  programs , but  with  the  very 
nature  of  the  Box-Jenkins  approach.  The  Box-Jenkins  method 
is  an  iterative  approach  which  is  described  in  Figure  1. 

[See  Wheelwright  and  Makridakis,  ref.  7.] 


Stage  1 


Stage  2 


Stage  3 


Box-Jenkins 


forecasting  method. 
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Figure  1 


Figure  1 shows  that  the  Box-Jenkins  method  is  a multi- 
stage, iterative  process.  It  begins  with  the  postulation  of 
general  class  of  models  which  has  been  found,  experimentally 
to  be  extremely  rich.  Thereafter,  the  modeling  procedure 
continues  as  a trial-and-error  process  with  several  decision 
points  where  the  analyst  is  required  to  select  the  next 
direction  based  on  the  best  information  available  to  him. 

Each  stage  of  the  process  outlined  in  Figure  1 may  consist 
of  several  steps,  and,  even  with  the  existing  computer  soft- 
ware resources,  the  modeling  process  is  usually  very  time 
consuming.  For  example,  a typical  Box-Jenkins  time  series 
analyst  using  a batch  processing  computer  system  with  access 
to  the  IMSL  library  of  subroutines  might  perform  the  follow- 
ing sequence  of  tasks: 

1.  Prepare  time  series  data. 

2.  Plot  and  visually  examine  the  time  series  looking  for 
nonstationarity , trends,  deterministic  patterns,  etc. 

3.  Write  a program  to  call  the  IMSL  subroutine  that 
calculates  the  mean,  the  variance,  the  autocorrela- 
tions and  the  partial  autocorrelations. 

4.  Plot  the  autocorrelations  and  partial  autocorrelations. 
This  provides  the  major  information  needed  for 
identification  of  the  time  series. 

5.  Write  a program  to  call  the  IMSL  subroutine  which 
transforms  the  time  series  to  adjust  for  seasonal 
patterns,  nonstationary  behavior  or  other  behavior 
which  deviates  from  that  assumed  by  the  class  of 
models  postulated. 

6.  Repeat  steps  2)  through  4)  using  the  transformed 
data  . 

7.  Review  the  statistical  properties  of  the  autocorre- 
lations and  partial  autocorrelations  for  tentative 
identification  of  the  model. 
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8.  Write  a program  to  call  the  IMSL  program  that  esti- 
mates the  model  parameters  and  computes  the 
residuals . 

9.  Write  a program  to  perform  goodness  of  fit  tests. 

10.  Analyze  the  residuals  following  steps  1)  through  9) 
just  as  was  done  with  the  original  time  series. 

11.  Refine  the  model  using  the  information  obtained  by 
the  analysis  of  the  residuals. 

12.  Repeat  steps  8)  through  10). 

13.  When  an  adequate  model  is  obtained,  write  a program 
to  call  the  IMSL  subroutine  which  forecasts  and 
determines  confidence  intervals  for  future  values 
of  the  time  series. 

Between  each  pair  of  steps  the  user  must  manually  intervene 
and  make  a subjective  decision  based  on  the  information 
available.  Thus,  a great  deal  of  user  interaction  is  required 
in  order  to  determine  a mathematical  model  and  a forecast 
equation.  Even  with  rapid  computer  turnaround  time,  the 
process  can  easily  consume  a day  or  more  of  calendar  time. 

This  report  describes  an  effort  to  alleviate  some  of  the 
problems  involved  with  modeling  time  series  using  the  Box- 
Jenkins  approach.  An  interactive  computer  package  which  pro- 
vides easy  user  access  to  the  computational  computer  sub- 
routines available  in  IMSL  and  similar  subroutine  libraries 
was  developed.  The  package,  called  the  Time  Series  Editor, 
was  written  for  time-shared  use  on  the  Naval  Postgraduate 
School's  Controlled  Program/Cambridge  Monitor  System  (CP/ 

CMS) . Since  all  programs  except  the  executive  routine  are 
written  in  FORTRAN,  the  Time  Series  Editor  could  be  easily 
modified  to  accommodate  other  time-sharing  systems.  The 
Time  Series  Editor  assists  the  user  in  data  preparation. 
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entry,  model  construction,  diagnostic  testing  and  forecast- 
ing. Utilization  of  the  package  requires  only  a limited 
knowledge  of  CP/CMS.  In  fact,  with  the  User's  Guide  pro- 
vided in  this  report,  a complete  Box-Jenkins  time  series 
analysis  can  be  performed  in  a short  time  by  even  a naive 
computer  user.  For  this  reason,  the  Time  Series  Editor 
should  be  valuable  as  an  instructional  aid  for  laboratory 
use  in  a time  series  class. 

A brief  description  of  the  Box-Jenkins  methodology  is 
given  in  Chapter  2 to  serve  as  a point  of  reference  for  the 
remaining  material.  Chapter  3 contains  descriptions  of 
each  of  the  programs  contained  in  the  Time  Series  Editor. 
The  use  of  the  Time  Series  Editor  is  illustrated  with  an 
example  time  series  which  is  given  in  Chapter  4. 

Chapter  5 contains  a summary  and  recommendations  for 
additions  to  the  Time  Series  Editor.  A User's  Guide  which 
includes  an  explanation  of  CP/CMS  sufficient  for  utiliza- 
tion of  the  Time  Series  Editor  is  given  in  Appendix  A. 
Sample  user  sessions,  sample  outputs  and  complete  computer 
listings  are  also  included  in  appendices. 
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II.  BOX-JENKINS  METHODOLOGY 


In  this  chapter  a brief  description  of  the  Box-Jenkins 
time  series  modeling  methodology  is  given.  For  a more  de- 
tailed discussion  the  reader  is  referred  to  the  texts  by 
Anderson  [Ref.  1],  Box  and  Jenkins  [Ref.  2],  Pindyck  and 
Rubinfeld  [Ref.  6],  and  Nelson  [Ref.  5],  The  material  pre- 
sented here  is  included  primarily  to  serve  as  a point  of 
reference  for  the  program  descriptions  that  follow  in  later 
chapters.  It  is  also  included  here  to  aid  the  user  in 
understanding  the  computer  output  and  the  questions  asked 
by  the  Time  Series  Editor. 

A.  PROPERTIES  OF  STATIONARY  PROCESSES 

A discrete  stochastic  time  series  is  a set  of  observa- 
tions y ^ , y ,...,  y^  generated  sequentially  in  time  by  a 
set  of  jointly  distributed  random  variables;  i.e.,  the  data 
y ,...,  yT  represents  a particular  realization  of  a joint 
probability  distribution  f ( y , y 2 , • • • , y T) . A future  obser- 
vation, yT+k  can  be  thought  of  as  being  generated  by  a condi- 
tional probability  distribution  function  f ( YT+k | y ^ , • • • , yT ) 
given  the  realization  through  time  T.  The  stochastic  pro- 
cess which  generates  the  time  series  is  said  to  be  station- 
ary if  its  properties  are  unaffected  by  a change  of  time 
origin;  that  is,  if  the  joint  probability  distribution  asso- 
ciated with  m observations  y , y ,...,  y , made  at  any 

1 t2  m 
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t is  the  same  as  that  distribution 
m 


1 


I 
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( 


set  of  times  t^,  t 2 

associated  with  m observations  y 

at  times  t,+k,...,  t +k . 

1 m 

If  a process  is  stationary, 
f ( y ^ ) is  the  same  for  all  times 
constant  mean 


t, +k ' y t_+k 


/ • • • t 


t+k 

m 


made 


the  probability  distribution 
t.  Thus  the  process  has  a 


y 


00 

E[yt]  = / yf (y ) dy 
— 00 


which  defines  the  level  about  which  it  fluctuates, 
stant  variance 


and  a con- 


00 

a2  = E [ (y fc-y ) 2 ] = j (y-y)2f(y)dy 

— CO 

which  measures  its  variability  about  the  mean  level.  Since 
the  probability  distribution  f(yfc)  is  the  same  for  all  times 
t,  the  mean  and  the  variance  can  be  estimated  by  the  averages 
taken  over  time: 


y = \ Z Yt  and  52  = £ Z (yt-y)2 

t=i  t=i 

The  stationarity  assumption  also  implies  that  the  bi- 
variate distribution  f (y  , y ) is  the  same  for  all  times 

12 

t^  and  t2  such  that  is  constant.  The  autocorrelation 

at  lag  k,  p^,  is  defined  as: 


e t (yt-y) (yt+k-y 1 


This  is  estimated  by  the  time  average: 

N-k 

t=l 
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The  plot  of  the  autocorrelation  function  vs.  the  lag  k, 
called  the  correlogran,  is  very  useful  for  the  purpose  of 
determining  if  a process  is  stationary  and  for  identifying 


the  appropriate  model. 

Another  function  which  is  important  for  purposes  of 

identifying  the  appropriate  linear  time  series  model  is  the 

partial  autocorrelation  function.  Let  0 = b + b.  yx  ,+  b_' 

e t o 1 t+1  2 

y „+...+  b,  , y , where  the  b's  are  the  least  squares 
•‘t+2  k-1  t+k-1 

estimates  of  the  linear  regression  coefficients  (B's)  in 
the  model 


yt  = So  + 6lyt+l  +-'*+  8k-lyt+k-l  + et 


Let  be  the  residual  of  yfc  after  removing  the  linear  effect 
of  yt+l'--*yt+k-l  fr°m  yt?  i*e‘ 


2t  = yt  " yt 


The  partial  autocorrelation  of  lag  k,  denoted  ({i^/is  defined 
to  be  the  simple  correlation  of  lag  k for  the  adjusted  series 

Z-ir  z0/...  . 

X 4m 

B.  AUTOREGRESSIVE-MOVING  AVERAGE  MODELS 

The  general  class  of  models  postulated  by  Box  and  Jenkins 
for  stationary  time  series  is  the  class  of  linear  models 
defined  by: 


yt  = Vt-l  + <»2yt-2+---+Vt-p  + eo  + at-6iat-l-02at-2- 


■0  a 
q t-q 
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where  {at}  is  a sequence  of  observations  from  a white  noise 

2 

process  (E[a  ] = 0,  var[a  ] = a and  E [a  a ] = 0 for  all 

tH  U cl  t t *r  K 

k > 0)  and  0 1 , . . . , 0p , 0Q,  are  p + q+1  parameters  that 

are  to  be  estimated  from  the  data.  The  model  above  is 

called  a mixed  autoregressive-moving  average  (ARMA)  model  of 

order  (p,q).  If  q=0,  the  model  is  called  an  autoregressive 

model  of  order  p,  AR  ; and  if  p=0,  the  model  is  called  a raov- 

P 

ing  average  model  of  order  q,  MA^.  Thus,  the  general  linear 
model  of  Box-Jenkins  represents  the  current  observation  y 
as  a weighted  sum  of  past  observations  and  present  and  past 
random  shock  terms.  The  model  is  usually  expressed  in  abbre- 
viated form  using  operator  and  transfer  function  notation: 

<MB)y.  = 0 + 0 (3)  a (2) 

to  t 

where  d>  ( B ) = 1-0  , B — <+J  _B  2- . . . -0  BP,  0(B)  = 1-0.B-...-0  and 

12  p 1 q 

B is  the  operator  defined  by  B^  = yfc  In  order  to  guaran- 

tee stationarity  it  is  necessary  that  the  autoregressive 
parameters  satisfy  certain  conditions.  The  conditions  can 
be  summarized  by  stating  that  all  roots  of  the  polynomial 
equation  0(B)  = 0 (treating  B as  a dummy  variable)  must  lie 

outside  the  unit  circle. 

The  tentative  identification  of  the  appropriate  member  of 
the  general  class  (the  identification  of  p and  q)  is  accom- 
plished by  comparing  the  sample  autocorrelation  and  partial 
autocorrelation  functions  of  the  given  time  series  with  the 
theoretical  autocorrelation  and  partial  autocorrelation  func- 
tions of  members  of  the  general  linear  class.  For  most 
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stationary  time  series  an  adequate  fit  can  be  found  in  a 
model  with  p and  q relatively  small,  say  three  or  less. 

C.  HOMOGENEOUS  NONSTATIONARY  SERIES 

In  many  cases  the  time  series  of  interest  is  not  station- 
ary. Instead,  the  probabilistic  structure  of  the  process 
which  generates  the  time  series  may  change  with  time.  For 
example,  there  may  be  some  sort  of  trend  or  seasonal  pattern 
in  the  time  series.  If  the  process  does,  nevertheless, 
exhibit  behavior  which  is  somewhat  homogeneous  then  the 
original  time  series  can  often  be  transformed  into  a station- 
ary series  that  can  be  described  by  an  ARMA  model. 

A time  series  is  said  to  be  homogeneous  nonstationary  of 
order  d if  w = Ady,_  is  a stationary  series.  Here  A denotes 
the  differencing  operator: 

Ayt  = yt  " yt-l  = (1“B)yt 

» Jc  . , . k- 1 . 

A y t = A(A  yfc) 


* 

* 


That  a time  series  is  nonstationary  is  indicated  by  a plot 
of  the  time  series  itself  (e.g.  a nonconstant  mean)  and  by 
the  autocorrelation  function.  Characteristic  of  the  correlo- 
gram  of  a nonstationary  series  is  the  very  slow  damping  out 
of  the  autocorrelation.  When  this  property  of  the  correlo- 
gram  is  observed,  the  user  should  difference  the  series  one 
time  and  compute  the  correlogram  for  the  series  wfc  = Ayfc. 

The  user  should  continue  to  difference  the  series  until  the 
resulting  series  appears  stationary  or  until  the  procedure 
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appears  not  to  improve  the  series.  The  transformed  series 
= Adzfc  is  then  modeled  as  an  ARMA  model.  The  resulting 
model.,  in  terms  of  the  original  time  series  is: 

4)  (B)  Vdyfc  = 0q  + 9 (B)  at  . (3) 

In  this  form  of  the  model,  the  transfer  function  <J)(B)  is 
assumed  to  be  stationary;  i.e.,  all  roots  of  tj>(B)  = 0 are 
outside  the  unit  circle.  It  is  sometimes  written  in  the 
equivalent  form 

iMB)yfc  = 0Q  + 9 (B)at  (4) 

where  ip(B)  = 4>(B)V  = <j>(B)  (1-B)  and,  clearly,  ip  (B)  is  not 

a transfer  function  of  a stationary  series  (<|;(B)  = 0 has  d 

roots  on  the  unit  circle) . The  ARMA  model  for  the  differ- 
enced series  is  called  an  autoregressive  integrated  moving 
average  ( ARIMA ) model  of  order  (p,d,q).  For  the  purpose  of 
distinguishing  between  the  two  forms  of  the  ARIMA  model, 
equation  (3)  is  referred  to  as  the  differenced  form  and  (4) 
as  the  undifferenced  form. 

D.  SEASONAL  TIME  SERIES 

Seasonality  is  defined  as  cyclical  behavior  that  occurs 
on  a regular  calendar  basis.  For  example,  a highly  seasonal 
time  series  would  be  the  sales  of  Christmas  ornaments  which 
exhibit  a strong  peak  every  December.  Rainfall,  crop  yields 
livestock  production,  energy  cjnsumption,  and  many  other 
time  series  that  are  influenced  by  the  weather  all  exhibit 
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seasonal  patterns.  Seasonal  patterns  are  often  easy  to  spot 
simply  by  observing  the  time  series  directly.  However,  many 
times,  if  the  variability  in  the  time  series  is  large, 
seasonal  patterns  will  not  be  distinguishable  from  the  other 
fluctuations.  Recognition  of  seasonality  is  important  since 
it  provides  information  that  can  aid  in  modeling  and  fore- 
casting. The  autocorrelation  function  makes  recognition  of 
seasonal  patterns  easier. 

Suppose,  for  example,  that  a monthly  series  has  an  annual 
seasonal  pattern.  Then  the  realizations  should  show  some 
special  correlation  with  other  realizations  which  lead  or  lag 
by  12  months;  i.e.,  there  should  be  some  correlation  between 
y t and  y t+12 / yt  and  yt+24/  yfc  and  yt+36/  etc.  These  corre- 
lations should  manifest  themselves  in  the  autocorrelation 
function  which  should  show  peaks  at  k = 12,  24,  36,  etc. 

The  Box-Jenkins  modeling  approach  for  seasonal  (nonsta- 
tionary) time  series  is  to  first  transform  the  seasonal  series 
to  a new  series  which  is  stationary.  This  can  often  be 
accomplished  by  taking  seasonal  differences  A^yfc  defined  as 
follows ; 


Ay  = y 
s*t 


y 


t-s 


( 1-BS ) y t 


Asyt  = As(As"lyt) 


The  transformed  time  series  {w^}  (wfc  = Ay)  is  then  analyzed 
as  a stationary  time  series.  (It  may  be  necessary  to  perform 


more  than  one  seasonal  and/or  differencing  transformation 
before  the  resulting  series  is  stationary.)  Suppose  that 
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the  resulting  ARMA  model  for  the  transformed  series 
is 

$ ( B ) w = 9 + 0 (B) a 

to  t 

The  model  for  the  original  series  is  then 

(B)  (1-BS)  dy^  = 9q  + 0 (B)  a . 

E.  PARAMETER  ESTIMATION 

Suppose  the  series  has  been  tentatively  identified  as 
an  ARIMA  (p,d,q)  model: 

<P  (B)  Adyt  = 0Q  + 0 (B)  at  (5) 

where  <J>(B)  = 1 — <J) . B- . . . -<j)  B^  and  0(B)  = 1-0  B-...-0  B*^  • There 

IP  1 q 

2 

are  p + q+2  unknown  parameters,  { $ 1 , . . . , $ , 9q , 0 1 . . , 0 , aa  }, 

that  must  be  estimated.  The  Box-Jenkins  procedure  separates 

the  estimation  problem  into  two  parts.  First  estimates  are 

obtained  for  the  autoregressive-moving  average  parameters 

2 

<p  and  0,  and  the  estimates  are  then  made  of  a and  0 which 

a.  o 

are  functions  of  the  ARMA  parameters.  The  usual  procedure 

• A >S 

is  to  select  those  parameter  values  <£_  and  0.  that  minimize  the 

sum  of  squared  errors.  Let  y =0  / ( l-d> , - . . . -<b  ) and 

w o T1  p 

wfc  = Adyfc  - Then,  it  can  be  easily  shown  that  y^  = E[wt] 

and  that  (5)  can  be  rewritten  as: 

4>  (B)  (»t-yw)  = 9 (B)  afc 

or  a =0-1(B)<MB)(w-y) 

t t w 
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Let  w,  the  sample  mean,  be  the  estimate  of  p (if  d > 0,  p 

w w 

is  usually  0)  and  let 

afc  = 0 1(B)4>(B)  (wt~w)  • (6) 

Let  S($,§)  = Z a 2 . 

t t 

A /V 

The  objective  is  to  select  those  parameters  <p  and  0 that  mini- 

A A 

mize  S(6,0).  Since  the  equation  S is  nonlinear  in  the  para- 
meters, iterative  search  methods  must  usually  be  used  in  the 

2 

minimization.  An  estimate  of  a is  provided  by 

/v  2 ~ /•> 

0 = S ( <p  0 ) /n-p-q 

3 — 

F.  DIAGNOSTIC  CHECKING 

After  the  model  has  been  tentatively  identified  and  para- 
meter estimates  have  been  obtained,  the  next  task  is  to  test 
whether  or  not  the  original  specification  was  correct  and 
the  motel  is  adequate.  The  process  of  testing  the  model 
takes  many  forms,  but  usually  involves  at  least  the  following 
two  steps : 

1.  Generate  a simulated  series  from  the  estimated  model 
and  compare  the  simulated  series  and  its  autocorrela- 
tion functions  with  the  original  series  and  its 
respective  autocorrelation  and  partial  autocorrela- 
tion functions.  The  comparison  is  primarily  subjec- 
tive. 
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2.  Compute  the  residuals  of  the  estimated  model  from 

(6)  and  compare  the  properties  of  the  residuals  with 
those  assumed  for  the  shock  terms  of  the  actual  pro- 
cess. The  residuals  should  be  normally  distributed 
and  uncorrelated  with  each  other.  There  are  many 
quantitative  statistical  tests  that  can  be  used  to 
test  hypotheses  about  normality  and  zero  correlation. 

A plot  of  the  autocorrelation  and  partial  autocorrela- 
tion functions  of  the  residuals  provides  not  only  a test  of 
whether  or  not  the  residuals  are  uncorrelated,  but,  if  they 
are  correlated,  the  plots  suggest  modifications  to  the  model. 
For  example,  suppose  the  model  was  tentatively  specified  as 
the  ARMA (1,1)  model:  (1-0. 5B)  (y  -2)  = (l+0.7B)at  and  the 

autocorrelations  and  partial  autocorrelations  of  the  resid- 
uals suggest  the  model 

( 1-0 . 3B) a = u 

where  the  u's  are  white  noise  (uncorrelated  with  variance 
2 

) . Then,  the  next  model  entertained  for  the  original 
time  series  should  be  the  ARMA(2,1)  model: 

(1-0.3B)  (1-0.5B)  (yt-2)  = (l  + 0.7B)ut  . 

G.  FORECASTING 

The  objective  of  forecasting  is  to  predict  future  values 
with  as  little  error  as  possible.  The  criterion  most  often 
used  for  selection  of  the  best  forecast  is  that  forecast 
which  has  minimum  mean  square  forecast  error.  Thus,  if 
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yT(£)  represents  the  forecast  at  origin  T of  the  value  y , 

A 

the  objective  is  to  select  yT(Jt)  so  that 
E [ (yT+Jl  ’ ?tU))2] 

A 

is  minimized.  This  forecast  is  given  by  taking  yT(&)  as  the 
conditional  expectation  of  yT+£: 

yTU)  = E [yT+£  i yT'yT_!' • • * '^i] 

The  forecasts  can  be  easily  generated  recursively  from  the 
mathematical  model  utilizing  the  fact  that 


E[yT_^]  = yT_j  for  j=0,l,2  (T  is  current  time) 


and 


0 if  t > T 


E [afc] 


a if  t £ T 


For  example,  suppose  the  estimated  model  is 


(1-0. 5B  + 0.6B  ) y t = (l+0.3B)at 


or,  equivalently. 


Yt  = °-5Yt-l  - °*6yt-2  + at  + °*3at-l 
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r 


where  y1Q0 

= 1.4,  y99 

= 

1.0,  and  a100  = 0.2.  The  forecasts 

of  yioi'  y 

102'  and  y!03 

made  at  time  t = 100  are  found  as 

follows : 

yioo { 1) 

= 

E ^yl01 1 yi00 ' y99 ' ’ ‘ ‘ ' yl  ^ 

= 

E[°'5yi00-°'6y99+ai01+0-3ai001 

— 

°-5yi00  “ °*6y99  + ° ’ 3 a 1 0 0 

yioo  (1) 

= 

0 . 16 

yioo  (2) 

= 

Etyl02lyioi,ylOO''**'yi] 

= 

E[0-5yi01-0-6yi00+ai02+0*3ai01] 

= 

°*5yioo(1)  - °-6yl00 

yioo  {2) 

= 

-0.76 

yioo  (3) 

= 

Ef0'5yi02-°-6yi01+ai03+0*3ai02] 

= 

°*5yioo(2)  - °*6yioo{1) 

= 

-0 . 43 

Let  e^ ( l ) 

A 

yT+£  ~ Y T 

U) 

be  the  forecast  error  Z periods 

ahead.  It 

can  be  shown 

that  eT(£)  is  given  by 

eTU)  = 

where  the 

weights  \jj 

are 

determined  from 

ip  (B)  = 

4>" 

1 (B)  (1-B) ~d9  (B)  . 

The  variance  of  the  forecast  error  is  given  by 


E[eT2U)]  = (1  + 4j12+...+  4^  ^ _ -l  ) cra  2 (9) 
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From  this,  a confidence  interval  of  z standard  deviations 
around  a forecast  Z periods  ahead  would  be  given  by 


Z-l 

C (y_U>>  = y_U>  + z ( 1 + ^V72  a (10) 

2 T T : = 1 3 

Note  from  expression  (8)  that  the  one-step  ahead  forecast 
error,  e^l),  is  simply  aT+1,  i.e. 


yT+l  ' yTU)  = aT+l 


This  explains  the  common  use  of  the  word  residual  to  refer 
to  the  random  shock  terms.  Also,  from  expressions  (9)  and 
(10)  it  is  clear  that  the  forecast  error  variance  is  a non- 
decreasing function  of  the  length  of  the  forecast  period  Z . 
Thus,  the  confidence  bands  must  get  wider  as  the  forecast 
period  gets  larger. 
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DESCRIPTION  OF  THE  TIME  SERIES  EDITOR 


In  this  chapter,  descriptions  are  given  of  each  program 
in  the  Time  Series  Editor  that  interacts  with  the  user. 

There  are  eight  separate  program  modules  that  are  currently 
included  in  the  Editor.  In  addition,  there  are  a few  other 
programs  that  reside  in  the  Editor,  but  which  are  completely 
transparent  to  the  user.  The  latter  programs  serve  useful 
functions  in  conjunction  with  the  other  eight  modules,  but 
they  are  not  described  here. 

No  attempt  is  made  to  describe  the  actual  mathematical 
calculations  or  algorithms  that  are  performed  by  the  programs. 
Rather,  the  objective  is  to  give  general  descriptions  about 
what  each  program  can  do  for  the  user  and  how  the  user  in- 
teracts with  the  programs.  The  eight  programs  described  in 
this  chapter  are  TIMESER  EXEC,  PLOT,  DIFF,  AUTO,  ESTIMATE, 
FORECAST,  SIMULATE  and  GENERATE. 

A.  THE  EXECUTIVE  PROGRAM 

The  Time  Series  Editor  contains  a master  program,  called 
TIMESER  EXEC, that  provides  file  control  of  all  of  the  other 
programs,  controls  input  and  output,  takes  care  of  the  neces- 
sary CP/CMS  protocol  and  provides  instructions  to  the  user 
as  to  what  is  in  the  Time  Series  Editor  and  how  each  program 
can  be  used.  TIMESER  is  written  in  a special  CP/CMS  Exec 
language.  It  is  the  only  program  in  the  Editor  that  is  not 
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written  in  FORTRAN,  and,  consequently,  it  should  be  the  only 
program  that  would  need  modification  if  the  Editor  were  to 
be  adapted  to  another  time  sharing  system. 

After  the  user  has  logged  into  CP/CMS  and  linked  to  the 
files  containing  the  Time  Series  Editor  (a  user's  guide  for 
this  is  given  in  Appendix  A)  the  entire  Editor  package  is 
made  available  to  the  user  by  his  command,  TIMESER  EXEC.1 
On  entry  of  this  command,  the  EXEC  provides  a guided  tour 
through  the  Editor.  It  tells  the  user  what  the  Editor  can 
provide;  it  asks  the  user  what  tasks  he  wants  to  do;  and,  on 
the  basis  of  the  user's  answers,  it  instructs  the  user  as  to 
what  data  is  required  and  how  it  must  be  entered.  When  the 
user  selects  an  option  for  execution,  the  EXEC  loads  the 
appropriate  program(s)  and  automatically  manipulates  any  re- 
quired input  and  output  files. 

B.  DATA  ENTRY  AND  PROGRAM  OUTPUT 

Whenever  data  is  required,  the  user  is  prompted  by  either 
the  EXEC  or  the  program  module  being  executed.  In  most  cases, 
the  necessary  user  response  is  a single  alphanumeric  charac- 
ter input  during  execution  by  keyboard.  However,  in  some 
cases,  the  amount  of  data  required  is  too  bulky  for  keyboard 
entry,  and  the  data  is  entered  more  efficiently  offline  via 
cards  or  tapes.  Similarly,  most  of  the  output  is  typed  out 

1There  is  a second  executive  routine,  called  TS  EXEC, 
that  can  be  used  by  the  more  experienced  analysts  who  wish 
to  suppress  some  of  the  user  instructions.  This  abbreviated 
program  provides  the  same  basic  services  as  TIMESER  EXEC. 
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right  at  the  user's  terminal,  but,  in  some  cases,  the  output 
is  printed  offline  for  conservation  of  time  and  to  provide 
a hard  copy  of  the  results. 

Detailed  descriptions  are  given  of  the  input  and  output 
requirements  of  each  program  in  the  individual  program  write- 
ups. However,  there  are  some  general  principles  of  data 
input  that  apply  for  all  programs.  These  are  described  in 
this  section. 

1 . Offline  Card  Input 

When  a large  volume  of  data,  such  as  a time  series 
of  a hundred  or  more  observations,  is  required,  the  data  can 
be  entered  more  efficiently  through  mechanisms  other  than 
keyboard  entry.  Keyboard  entry  would  be  not  only  much 
slower,  but  also  more  likely  to  contain  errors  than  other  in- 
put mediums.  Thus,  the  Time  Series  Editor  requires  that  the 
time  series  data  be  entered  offline  via  cards.  The  data  are 
read  offline  and  stored  in  the  user's  file  FT02F001  which  is 
read  automatically  when  required  by  the  Time  Series  Editor. 

An  example  of  an  input  data  deck  for  use  in  the  Editor  is 
shown  in  Figure  2. 
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2 . Numeric  Keyboard  Input 

When  the  user  is  prompted  to  enter  numerical  values 
such  as  the  number  of  observations,  the  number  of  parameters, 
parameter  estimates  or  starting  conditions,  he  must  enter 


those  values  before  program  execution  can  continue.  The  user 
should  enter  the  data  according  to  standard  FORTRAN  practice. 
That  is,  integer  data  should  be  entered  (without  decimal)  for 
counts  and  names  beginning  with  the  letters  I through  N;  float- 
ing data  should  be  entered  with  decimal  point  for  all  other 


variables.  Because  a typed  decimal  point  overrides  a float- 
ing format  it  is  not  necessary  for  the  user  to  concern  himself 
with  the  format  for  floating  data  (the  user  is  never  asked  to 
enter  more  than  a single  observation  on  a line)  . However, 
some  care  must  be  exercised  when  entering  integer  data  because 
integer  data  must  be  right  justified  in  its  format  field. 

The  user  is  told  whenever  the  integer  format  is  anything  other 
than  II.  For  example,  suppose  the  user  wants  to  analyze  a 
time  series  having  only  65  observations.  If  the  program  that 
he  is  executing  requires  the  length  of  the  time  series,  the 
user  will  receive  the  following  request: 


ENTER  LENGTH  OF  THE  TIME  SERIES,  L,  VIA  13. 
The  user  should  then  enter: 


col.  123 

b65  (b  represents  a blank  space) 

If  the  blank  were  omitted,  the  program  would  read  the  length 
as  650  and  many  problems  would  occur. 
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Alphabetic  Keyboard  Input 


The  Time  Series  Editor  often  requests  the  user  to 
respond  with  alphabetic  input.  For  example,  it  may  ask  him 
a question  that  requires  a yes  or  no  answer,  or  it  may  ask 
him  what  option  he  wants  to  execute.  The  editor  has  been 
programmed  to  read  only  the  first  letter  of  the  user's  re- 
sponse. Thus,  he  need  only  enter  a single  letter  for  each 
such  inquiry.  For  example,  he  should  enter  Y for  yes,  N for 
no,  P for  the  PLOT  option,  E for  the  ESTIMATE  option,  etc. 

4 . Output 

Most  of  the  results  are  written  out  right  at  the 
user's  terminal.  In  some  cases,  however,  the  output  is  prin- 
ted offline  to  conserve  time.  Such  results  as  plots  and 
transformed  time  series  are  written  onto  various  files 
(FT03F001 , FT08F001 , FT01F001)  and  the  EXEC  program  prints 
them  offline  under  the  user's  identification  number.  The 
files  can  also  be  printed  out  at  the  user's  terminal  at  his 
request . 

C.  PLOT  PROGRAM 

The  PLOT  program  plots  any  given  time  series  which  re- 
sides in  file  FT02F001.  Other  than  the  time  series,  which 
is  entered  offline,  the  program  requires  only  that  an  identi- 
fication title  for  the  plot  be  entered  by  the  user  during 
execution.  The  plot  is  automatically  printed  offline  and  is 
also  found  in  file  FT08F001.  The  PLOT  program  uses  the  sub- 
routine PLOTP  in  the  IBM  Scientific  Subroutine  Package 
Library  (SSPLIB) . 
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DIFF  PROGRAM 


T 


D . 


The  DIFF  program  performs  seasonal  and/or  nonseasonal 
differences  and  it  allows  the  user  to  transform  a given  time 
series  using  logarithmic  or  exponential  transformations.  The 
program  requires  the  given  time  series  to  reside  in  file 
FT02F001,  and  it  requires  the  user  to  input  the  following 
information  during  program  execution: 

(1)  Type  of  time  series  (seasonal,  nonseasonal) 

(2)  Order  of  differencing  (for  stationarity) 

(3)  Length  of  seasonal  period  (for  seasonal  series  only) 

(4)  Type  of  transformation  (none,  exponential, 
logarithmic ) 

(5)  Transform  parameters  (if  log  or  exponential  trans- 
form is  desired) 

(6)  Yes  or  no  responses  to  questions  about  plotting  of 
autocorrelations  and  partial  autocorrelations  of 
transformed/differenced  series. 

The  transformed/differenced  series  is  written  out  into  file 
FT03F001  and  can  be  printed  out  at  the  user's  terminal  if 
he  requests  a printout.  Plots  of  the  transformed/differenced 
series,  its  autocorrelations,  and  its  partial  autocorrela- 
tions can  be  printed  offline. 

This  program  is  use'd  to  transform  a given  series  that  may 
be  seasonal  and/or  nonstationary  or  nonhomogeneous  into  a 
stationary  series  of  the  type  possible  for  analysis  using 
the  Box-Jenkins  methodology.  The  program  utilizes  the  IMSL 
subroutine,  FTRDIF. 


E.  AUTO  PROGRAM 

AUTO  calculates  summary  statistics  from  a given  time 
series.  The  summary  statistics  include  the  sample  mean. 


33 


variance,  autocorrelations,  and  partial  autocorrelations  of 


logs  one  through  20.  All  of  the  statistics  are  printed  out 
at  the  user's  terminal,  and,  in  addition,  plots  of  the  auto- 
correlations and  partial  autocorrelations  are  printed  out 
offline.  Furthermore,  the  plots  reside  in  the  user's  file 
FT08F001  so  that  they  may  also  be  printed  at  the  user's  ter- 
minal. That  is,  however,  a very  time  consuming  process. 

AUTO  provides  the  major  information  about  s tationar i ty , 
seasonality  anc  model  identification.  With  the  Box-Jenkins 
procedure,  the  second  moments  (autocorrelations  and  partial 
autocorrelations)  are  the  primary  tools  for  tentative  model 
identification.  In  addition,  AUTO  is  used  in  the  diagnostic 
checkout  phase  of  model  building  to  test  the  residuals. 

The  time  series  (original,  transformed  or  differenced, 
or  residuals)  must  reside  in  file  FT02F001.  (When  a series 
is  transformed  or  the  residuals  are  calculated,  those  values 
are  automatically  stored  in  file  FT02F001  temporarily  for 
further  analysis.)  The  program  uses  the  IMSL  subroutine 
FTAUTO . 

F.  ESTIMATE  PROGRAM 

After  the  user  has  tentatively  identified  a model,  the 
Editor  program  ESTIMATE  should  be  executed  to  calculate  maxi- 
mum likelihood  estimates  of  the  model  parameters.  It  esti- 
mates the  autoregressive  parameters,  the  moving  average 
parameters,  the  constant  term  and  the  variance  of  the  shock 
terms.  It  also  determines  the  residuals  which  are  so  impor- 
tant for  the  diagnostic  checkout  phase  of  model  building. 
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ESTIMATE  uses  the  IMSL  subroutine  FTMAXL  to  estimate  the 
parameters . 

ESTIMATE  needs  the  time  series  in  file  FT02F001  prior  to 
execution.  During  program  execution  the  user  will  be  prompted 
with  the  following  requests: 

1)  Enter  number  of  autoregressive  (AR)  parameters. 

2)  Enter  number  of  moving  average  (MA)  parameters. 

3)  Enter  the  number  of  differences. 

ESTIMATE  then  provides  the  following  output: 

1)  Estimated  AR  parameters  for  the  undifferenced  form 
of  the  model. 

2)  Estimated  MA  parameters. 

3)  Estimated  overall  MA  constant  and  white  noise  variance. 

4)  Auto-  and  partial  correlations  of  residuals. 

5)  Plots  of  ( 4) . 

6)  Values  of  residuals  printed  out  offline  and  residuals 
stored  in  file  FT03F001. 

7)  Chi-square  goodness  of  fit  value  for  estimated  model. 

G.  FORECAST  PROGRAM 

The  Editor  program,  FORECAST,  uses  the  estimated  mathe- 
matical model  to  compute  forecasts  of  future  values  of  the 
time  series.  It  also  computes  (1-a)  100%  probability  limits 

for  the  forecasted  values.  The  program  utilizes  the  IMSL 
subroutine  FTC AST . 

The  time  series  must  reside  in  file  FT02F001  before  exe- 
cution begins.  During  execution,  the  user  is  required  to 
enter  the  following  inputs: 
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1)  Origin  of  forecasts 

2)  Number  of  AR  parameters 

3)  Estimated  values  of  AR  parameters 

4)  Number  of  MA  parameters 

5)  Estimated  values  of  MA  parameters 

6)  Overall  MA  constant  and  white  noise  variance 

7)  Maximum  lead  time  for  a forecast 

8)  Order  of  differencing  in  model 

9)  Significance  level  for  forecast  confidence  limits 

The  program  then  provides  the  following  output: 

2 

1)  AR  parameters  in  undifferenced  form 

2)  Forecasts  for  lead  times  i = 1,2,..., max 

3)  Deviations  from  each  forecast  for  the  (1-ct)  100% 

confidence  limits 

4)  Plots  of  forecasts  and  corresponding  deviations  joined 
with  the  original  time  series.  This  is  plotted 
offline  . 

H.  SIMULATE  AND  GENERATE  PROGRAMS 

Because  of  their  similarities  the  Editor  programs  SIMULATE 
and  GENERATE  are  described  together.  GENERATE  allows  the  user 
to  generate  a time  series  from  any  ARIMA  model  that  he 
specifies.  The  user  must  identify  the  model  and  give  values 
for  its  parameters  and  starting  conditions.  The  program  takes 
the  specified  model,  generates  random  noise  terms,  and 
calculates  as  many  values  of  the  time  series  as  desired. 


2 

Suppose  the  identified  model  was  ARIMA  (1,1,0).  The 
differenced  form  is  ( 1 -<$  ^ B ) Ay  ^ = 9^  + afc.  The  undifferenced 

form  is  ( ]_  _ ( i + <£  ^ ) b + <}>  ^ B2  ) y ^ = 9q  + a found  by  multiplying 
(1-4>1B)  by  (1-B)  . 
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This  program  is  useful  for  purposes  of  classroom  instruction 
for  the  generation  of  a wide  variety  of  time  series  examples 
for  model  identification.  It  is  also  useful  for  the  diag- 
nostic phase  of  model  checkout.  A time  series  can  be 
generated  from  the  estimated  model  and  its  properties  can  be 
compared  with  those  of  the  original  series.  If  large  dis- 
crepancies occur  the  estimated  model  may  be  inadequate. 

The  program  requires  the  user  to  enter  the  following  in- 
puts during  execution: 

1)  A randum  number  seed 

2)  Number  of  AR  parameters  (undifferenced  form) 

3)  Number  of  MA  parameters 

4)  Length  of  time  series 

5)  White  noise  variance 

6)  Values  of  AR  and  MA  parameters 

7)  Initial  starting  values 

The  program  output  is  the  generated  time  series. 

The  SIMULATE  program  provides  the  capability  of  generat- 
ing any  number  of  simulated  time  series'.  This  is  useful  for 
predicting  what  might  happen  in  the  future  and  to  demonstrate 
that,  even  within  a given  model,  the  actual  observed  time 
series'  can  differ  substantially.  This  program  uses  GENERATE 
but  also  requires  as  input  the  number  of  simulated  series  the 
user  wishes  to  generate.  Furthermore,  the  SIMULATE  program 
allows  the  user  to  select  values  of  the  original  time  series 
as  starting  values  for  the  simulated  series'.  Output  consists 
of  the  simulated  series'  and  plots. 
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This  completes  the  descriptions  of  the  programs  contained 


5 


in  the  Time  Series  Editor.  The  actual  program  listings  are 
given  in  Appendix  C and  sample  user  sessions  are  shown  in 
Appendix  B.  In  the  next  chapter,  an  example  is  given  of  an 
entire  time  series  analysis  from  plotting  to  diagnostic  test- 
ing and  forecasting. 


IV.  EXAMPLE  TIME  SERIES  ANALYSIS 


In  this  chapter  a description  is  given  of  a complete 
analysis  of  a time  series  using  the  Box-Jenkins  procedure 
and  the  Time  Series  Editor.  The  time  series  analyzed  is 
series  C (Chemical  Process  Temperature  Readings)  from  Box 
and  Jenkins  [Ref.  2,  p.  528] . This  time  series  was  selected 
because  it  is  analyzed  completely  in  Ref.  2 so  that  the 
user  can  compare  the  results  found  there  with  the  results 
given  by  the  Time  Series  Editor. 

The  first  step  in  the  analysis  of  series  C is  to  plot 
the  time  series.  This  is  shown  in  Figure  3.  The  plot 
reveals  rather  wide  fluctuations  in  the  series  but  not  the 
sort  of  explosive  nonstationary  behavior  that  would  render 
a modeling  attempt  fruitless.  The  plot  also  reveals  that 
the  time  series  has  a large  amount  of  momentum  (movements 
of  the  series  tend  to  resist  changes  of  direction) . This 
is  characteristic  of  ARIMA  models  with  one  or  two  differences. 

Second,  the  autocorrelations,  partial  autocorrelations, 
mean,  and  variance  of  the  series  were  estimated  using  AUTO. 

The  numerical  values  are  printed  out  in  Table  I and  plots 
of  the  autocorrelations  and  partial  autocorrelations  are 
given  in  Figures  4 and  5.  Figure  4 shows  that  the  autocor- 
relations dampen  out  slowly  in  a near  linear  fashion.  This 
is  an  indication  that  the  series  is  nonstationary  and  that 
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one  or  more  differences  are  needed  t 
The  partial  autocorrelation  plot  is 
the  autocorrelations  fail  to  dampen 

As  suggested  by  the  plots  of  original  series  and  its 
autocorrelations,  the  program  DIFF  was  executed  to  trans- 
form the  series  { y t } to  the  series  { } where  w^  = Ay  = 
yfc  - yfc_1.  The  values  of  the  series  {wt)  are  tabulated  in 
Table  II.  The  autocorrelations  and  the  partial  autocorre- 
lations of  the  differenced  series  were  then  calculated  and 
plotted.  The  values  are  given  in  Table  III  and  the  corre- 
lation plots  are  shown  in  Figures  6 and  7.  The  correlation 
plots  suggest  that  the  first  differenced  series  is  either 
ARMA (1,0)  with  the  AR  parameter  near  unity  (the  autocorre- 
lations of  the  first  differences  also  dampen  out  slowly) 
or  a second  difference  is  required.  Thus,  two  candidates 
are  suggested: 

1)  ARIMA (1,1,0)  : ( l-$ x B ) ( 1 -B ) y = 9q  + a 

and  2)  ARIMA ( 0 , 2 , 0 ) : (1-B)2y  = 9 + a . 

t o t 

For  purposes  of  estimation,  the  second  model  was  extended  to 
include  two  moving  average  terms.  Such  " over f i t t ing"  is 
often  done  to  see  if  the  estimated  moving  average  parameters 
turn  out  to  be  near  zero,  thus  confirming  the  tentative 
identification.  Thus,  the  two  models  entertained  were: 

1)  ( l-(}>  B)  Ay  =9  + a^  ARIMA(1,1,0) 

i t o t 

2 2 

and  2)  A yfc  = 9Q  + (1-0^  - Q 23  ) afc  ARIMA(0,2,2) 

The  next  step  is  to  calculate  maximum  likelihood  esti- 
mates of  the  model  parameters.  The  program  ESTIMATE  was 
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used  to  do  this.  The  estimated  parameters  for  the  ARIMA 


(1,1,0)  are : 

$ = 0.8131 

9 =0.0 

o 

C 2=  0.0178396 
a 

The  autocorrelation  and  partial  autocorrelations  of  the 
residuals  were  calculated  and  plotted  to  test  the  model. 

The  correlations  should  appear  to  be  estimates  of  a pure 
white  noise  process  if  the  model  is  adequate.  The  model 

passed  that  test.  A goodne s s -o f - f i t test  was  also  performed 

2 

to  test  the  model.  The  chi-square  value  was  X = 21.51  with 
19  df  and  a significance  level  of  0.3082.  Thus,  there  is 
no  strong  evidence  to  suggest  that  the  (1,1,0)  model: 

(1-1.81318  + 0.8131B2)yt  = a (11) 

is  inadequate . 

Parameter  estimates  were  also  obtained  for  the  ARIMA 
(0,2,2)  model  using  ESTIMATE.  The  parameter  estimates  were: 

0 =0.0 

o 

9 = 0 .1382 

02  = 0.1300 

0 2=  0.0189515 
a 
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As  before,  the  correlation  plots  of  the  residuals  fail  to 
suggest  any  inadequacy  of  the  model.  However,  the  chi- 
square  lack  of  fit  test  yielded: 

2 

X = 28.74  with  18  df  and  a significance  level  of  0.0516. 

Thus,  if  the  ARIMA(0,2,2)  model  were  the  correct  model  a 
chi-square  value  as  large  as  28.74  would  occur  by  chance 
with  a probability  of  only  0.0516.  There  is  some  ground 
here  for  questioning  the  ARIMA(0,2,2)  model.  Because  of  its 
simplicity  (parsimony  is  a very  desirable  feature  for  all 
models)  and  its  better  fit,  the  ARIMA(1,1,0)  model,  equation 
(11),  was  selected. 

The  estimated  model,  equation  (11),  for  series  C was 
used  to  calculate  forecasts  and  confidence  limits  for  those 
forecasts.  The  forecasts  were  made  for  13  periods  into  the 
future  at  origin  t=20  using  the  observed  values  of  the 
original  time  series  as  starting  conditions.  Figure  8 shows 
a plot  of  the  forecasted  values  adjoined  to  the  original 
series  with  80%  confidence  limits  about  those  forecasts. 

The  forecasted  values  and  the  probability  deviations  are 
tabulated  on  page  59.  The  plot  shows  that  the  confidence 
limits  are  very  wide  for  lags  far  into  the  future. 

Finally,  program  SIMULATE  was  utilized  to  generate  two 
simulated  series  from  eq.  (11).  Values  of  these  two  simula- 
tions are  given  in  Table  IV.  Plots  of  those  simulated 
series  are  shown  in  Figure  9.  The  general  shape  of  those 
curves  is  like  that  of  the  original  time  series,  thus  con- 
firming the  estimated  model. 
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The  keyboard  printout  of  the  user  session  that  generated 
the  analysis  above  is  included  in  Appendix  B.  The  entire 
session  lasted  approximately  one  and  a half  clock  hours  in- 
cluding checkout  of  plots  and  consumed  less  than  one  minute 
of  CPU  time. 
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V.  SUMMARY  AND  RECOMMENDATIONS 


There  is  a growing  need  for  an  efficient  unified  collec- 
tion of  computer  programs  for  aiding  operations  researchers, 
statisticians,  economists,  managers,  and  other  people  who 
must  analyze  time  series.  The  Box-Jenkins  methodology  has 
opened  the  doors  of  time  series  analysis  to  an  expanding  popu- 
lation of  analysts.  Many  computational  algorithms  have  been 
developed  and  are  available  in  many  forms.  The  problem  has 
been  the  iterative  nature  of  the  Box-Jenkins'  model  building 
procedure.  Such  a procedure  is  straightforward  but  can  be 
very  time  consuming.  The  Time  Series  Editor  that  has  been 
described  in  this  report  provides  a unified  collection  of 
computer  programs  in  an  interactive  time-sharing  mode  that 
can  aid  the  user  in  all  phases  of  the  model  building  procedure 
from  the  plotting  of  the  time  series  to  the  forecasting  of 
future  values.  The  Editor  does  not  develop  new  computational 
algorithms.  Rather,  it  makes  those  that  are  available  easier 
and  faster  to  use.  With  its  simple  input  requirements  which 
are  all  prompted  by  written  instructions,  the  Editor  can 
easily  be  used  by  the  most  naive  computer  use. 

The  Box-Jenkins  methodology  has  been  described  in  Chapter 
2 and  descriptions  of  the  modules  of  the  Time  Series  Editor 
have  been  given  in  Chapter  3.  These  descriptions  were  given 
not  as  a substitute  for  study  of  the  Box-Jenkins  technique, 
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but  as  a communication  device  to  explain  to  the  potential 


user  of  the  Time  Series  Editor  what  happens  in  the  various 
stages  of  the  computational  process,  and  to  explain  what  the 
Editor  requests  mean.  An  example  time  series  analysis  cover- 
ing all  stages  of  the  Box-Jenkins  model  building  procedure 
was  covered  in  Chapter  5.  Appendices  contain  a User's  Guide 
to  the  Time  Series  Editor,  sample  user  sessions  and  program 
output,  and  program  listings. 

Although  the  Editor  covers  the  entire  model  building  pro- 
cess, as  described  by  Box  and  Jenkins,  from  the  plotting  of 
the  series  to  diagnostic  testing  and  forecasting,  much  more 
could  be  added  to  improve  the  Editor's  capability  and  utility. 
Listed  below  are  several  options  that  are  recommended  for 
addition  to  the  Time  Series  Editor. 

1.  Extend  the  diagnostic  ability  to  include  a periodo- 
gram  analysis  or  other  tests  related  to  the  spectral 
analysis  of  time  series. 

2.  Include  an  option  that  will  determine  all  roots  of 
the  characteristic  equation  and  give  the  general 
solution  to  the  autocorrelation  function  and  to  the 
eventual  forecast  function. 

3.  Modify  the  FORECAST  option  to  allow  forecasts  for 
seasonal  nonstation ary  series'. 

4.  Modify  the  ESTIMATE  option  to  provide  parameter  esti- 
mates for  seasonal  stationary  series'. 

5.  Expand  the  univariate  model  building  capability  to 
linear  transfer  function  model  building. 

6.  Expand  the  Time  Series  Editor  to  include  multivariate 
models  such  as  multiple  regression. 


L 


45 


APPENDIX  A 


USER'S  GUIDE  TO  TIME  SERIES  EDITOR 


46 


APPENDIX  A:  USER'S  GUIDE  TO  TIME  SERIES  EDITOR 


In  order  to  use  the  Time  Series  Editor,  the  user  must 
log  onto  CP,  link  to  the  disk  storage  area  where  the  Time 
Series  Editor  resides,  implement  CMS,  log  into  the  general 
user  and  Time  Series  Editor  disk  areas,  and  enter  the  EXEC 
routine.  It  is  also  necessary  to  log  out  of  the  system  at 
the  completion  of  execution.  The  material  which  follows 
will  enable  the  user  to  perform  the  above  steps  on  the  NPS 
CP/CMS  system.  Commands  marked  with  an  asterisk  (*)  are 
entered  by  the  user  (the  asterisk  itself  is  omitted).  Those 
without  an  asterisk  and  those  in  all  capital  letters  are 
system  responses.  Numbered  sentences  are  comments  which 
will  not  appear  during  an  actual  user  session.  The  instruc- 
tions and  system  responses  assume  an  IBM  2741  Input/Output 
Terminal.  Some  modifications  may  be  necessary  if  other 
terminals  are  used. 

1.  Turn  the  terminal  on,  depress  the  RETURN  key,  and  wait 
for  the  system  to  respond: 

CP-67  online  xd.65  qsyosu 

2.  Depress  the  ATTN  key.  The  roll  bar  will  advance  and 
the  keyboard  will  unlock.  Then  enter: 

♦login  xxxxgnn 

3.  xxxx  is  the  user's  identification  number,  and  nn  is 
the  terminal  number  (written  on  the  r igh t-hand- s i de 

of  the  terminal.  For  example,  if  the  user's  ID  is  0655 
and  the  terminal  number  is  07  the  input  would  be: 
login  0655g07 

4.  The  system  will  respond  with  the  statement: 

ENTER  PASSWORD: 
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5.  The  user  then  enters  his  password  (most  users  at  NPS 
have  the  password  npg) : 

*npg 

6.  The  system  will  then  respond: 

ENTER  4-DIGIT  PROJECT  NUMBER  FOLLOWED  BY  4-CHARACTER 
COST  CENTER  CODE: 

7.  The  user  then  enters: 

*aaaabbbb 

8.  aaaa  is  the  assigned  project  number  and  bbbb  is  the 
user's  section  designator  or  faculty  code. 

9.  The  system  will  respond  with  the  message  of  the  day 
such  as : 

CP/CMS  HRS..  0930-2200 (MON-THURS) .. 0930-1800 (FRI)  OUTPUT 
RETAINED  4 DAYS 
CMS  VERSION  3.2 

10.  At  this  point  the  user  is  in  CMS.  He  must  then  get  in- 
to CP.  This  is  done  by  hitting  the  ATTN  key.  The 
system  will  respond: 

C? 

11.  The  user  must  then  link  to  the  TIME  SERIES  EDITOR.  This 
is  accomplished  by  entering: 

* 1 ink  1 9 6 9p  191  192 

12.  The  system  will  respond  with  the  instruction: 

ENTER  PASSWORD: 

13.  The  user  then  enters: 

‘rfrr 

14.  The  system  will  respond: 

SET  TO  READ  ONLY 

15.  The  user  must  now  implement  CMS  by  entering: 

*ipl  cms 

16.  The  system  will  respond: 

CMS  VERSION  3.2 
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17.  Now  the  user  must  log  into  both  his  general  user  and 
the  Time  Series  Editor  area  by  entering: 

•login  191 

18.  The  system  will  respond  with  a message  about  the  status 
of  the  file  such  as: 

P ( 1 9 1 ) : 4 9 FILES;  225  REC  IN  USE,  71  LEFT  (OF  296), 

76%  FULL  (2  CYL) 

R; 

19.  The  user  should  then  enter  the  command: 

•login  192  t,p 
R; 

20.  The  system  will  respond: 

T ( 1 9 2 ) R/O 
R; 

21.  The  user  can  then  enter  the  Time  Series  Editor  by 
issuing  the  command: 

•timeser  exec 

22.  The  system  will  respond: 

YOU  HAVE  ENTERED  THE  TIME  SERIES  EDITOR 

PLEASE  RESPOND  TO  EACH  INQUIRY 

etc. 

23.  The  user  is  then  on  his  own,  guided  by  the  Exec  routine. 
See  the  notes  that  appear  at  the  end  of  this  Appendix 
for  additional  information.  Eventually,  the  user  will 
be  asked: 

DO  YOU  WANT  TO  TRY  AGAIN? 

24.  If  a yes  response  is  given  another  sequence  will  begin. 

A no  will  take  the  user  out  of  the  Time  Series  Editor. 
The  system  response  will  be: 

CONTROL  RETURNED  TO  CMS 
R; 

25.  The  user  then  can  log  out  by  entering: 

*cp  logout 

26.  The  system  will  respond: 
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CONNECT3  01.17.24  VIRTCPU=  001.03.37  TOTCPU=  001.30.14 
LOGOUT  AT  17.19.08  ON  3/21/77 

27.  The  user  should  then  turn  off  his  terminal  and  tear  off 
his  output. 

A sample  of  the  procedure  is  shown  below. 

cp-67  online  xd.65  qsyosu 

login  0655g07 
ENTER  PASSWORD: 

npg 

ENTER  4-DIGIT  PROJECT  NUMBER  FOLLOWED  BY  4-CHARACTER  COST 
CENTER  CODE: 

0986rk54 

CP/CMS  HRS, MONDAY  THRU  THURS DA Y , 0 9 3 0- 2 2 00 , FRI DA Y , 0 9 3 0 - 1 8 0 0 . 

OUTPUT  RETAINED  5 DAYS 

READY  AT  16.13.22  ON  03/22/77 

CMS  VERSION  3.2 

offline  read  * 

P (191):  49  FILES;  225  REC  IN  USE,  71  LEFT  (OF  296),  76%  FULL 

(2  CYL) 

OFFLINE  READ  FILE  FT02F001 
R; 

CP 

link  1969p  191  192 

ENTER  PASSWORD: 
r f rr 

SET  TO  READ  ONLY 


ipl  cms 

CMS  VERSION  3.2 
login  191 

P (191):  49  FILES;  225  REC  IN  USE,  71  LEFT  (OF  296),  76%  FULL 

(2  CYL) 

R; 

login  192  t , p 
T (192)  R/O 
R; 

timeser  exec 

YOU  HAVE  ENTERED  THE  TIME  SERIES  EDITOR 


DO  YOU  WANT  TO  TRY  AGAIN? 
n 

CONTROL  RETURNED  TO  CMS 
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R; 

cp  log 

CONNECT=  00.03.38  VIRTCPU=  000.00.34  T0TCPU=  000.01.30 
LOGOUT  AT  13.48.56  ON  3/11/77 


NOTES : 

a.  A typing  error  in  CP/CMS  can  be  corrected  by  typing  the  @ 
character  as  many  times  as  is  required  to  backup  and 
then  type  the  correct  values.  For  example,  the  command 
exec  timesre  could  be  corrected  by  typing  two  @ signs 
followed  by  the  correct  letters  er  as  follows: 

exec  timesre@@er 


b.  An  entire  line  can  be  deleted  by  typing  the  character  C . 

c.  The  user  should  exercise  care  before  depressing  the  re- 
turn key.  If  an  input  is  required  and  the  return  key 
is  depressed  before  any  character  is  entered,  the  user 
may  get  tossed  out  of  the  Editor.  Before  the  return  key 
is  depressed,  any  errors  can  be  corrected.  After  the 
return  is  hit  there  are  only  selected  cases  that  can 
later  be  corrected. 

d.  When  asked  to  enter  integer  data,  the  user  should  always 
right  justify  the  data  within  the  allowed  region. 

* * 

e.  If  at  any  time  the  user  encounters  a debug  condition 
(probably  caused  by  errors  in  entering  data)  and  he 
cannot  determine  what  needs  to  be  done,  he  can  get  out 
of  the  debug  condition  by  depressing  the  ATTN  key  twice. 
This  puts  the  user  back  into  CP  from  which  he  can  link 
back  to  the  Time  Series  Editor  or  logout. 
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APPENDIX  B 


SAMPLE  USER  SESSION  AND  PROGRAM  OUTPUT 


APPENDIX  B:  SAMPLE  USER  SESSION  AND  PROGRAM  OUTPUT 


cp-67  online  xd.65  qsyosu 

login  9655g07 
ENTER  PASSWORD: 

npg 

ENTER  4-DIGIT  PROJECT  NUMBER  FOLLOWED  BY  4-CHARACTER  COST  CENTER  CODE: 
0986rk54 

CP/CMS  HRS, MONDAY  THRU  THURSDAY, 0930-2200 , FRIDAY , 0930-1800 . 

OUTPUT  RETAINED  5 DAYS. 

FILES:-  01  RDR,  NO  PRT,  NO  PUN 
READY  AT  16.13.22  ON  03/22/77 
CMS  VERSION  3.2 

offline  read  * 

P (191):  49  FILES;  225  REC  IN  USE,  71  LEFT  (OF  296),  76%  FULL  (2  CYL) 

OFFLINE  READ  FILE  FT02F001 
R; 

CP 

link  1969p  191  192 
ENTER  PASSWORD: 
rfrr 

SET  TO  READ  ONLY 
ipl  cms 

CMS  VERSION  3.2 
login  191 

P (191):  49  FILES;  225  REC  IN  USE,  71  LEFT  (OF  296),  76%  FULL  (2  CYL) 
R; 

login  192  t,p 
T (192)  R/O 
R; 

timeser  exec 

YOU  HAVE  ENTERED  THE  TIME  SERIES  EDITOR. 

PLEASE  RESPOND  TO  EACH  QUERY  WITH  AN  INPUT  AT  THE  TERMINAL. 

ENTER  ONLY  THE  FIRST  LETTER  FOR  A WORD  RESPONSE. 

ENTER  NUMERICAL  VALUES  VIA  FORTRAN  FORMAT. 

TYPE  INTEGER  VALUES  FOR  NAMES  STARTING  WITH  I THRU  N. 

TYPE  INTEGER  VALUES  FOR  COUNTS.  TYPE  FLOATING 
VALUES  WITH  DECIMAL  FOR  ALL  OTHER  NAMES'. 

DO  YOU  WANT  A LIST  OF  OPTIONS? 

y 
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OPTION  DESCRIPTION 

GENERATE  GENERATE  ANY  ARIMA  TIME  SERIES 

AUTO  CALCULATE  AUTOCORRELATIONS,  PAUTOS , MEAN  AND  VARIANCE 

PLOT PLOT  A TIME  SERIES 

DIFF TRANSFORM  AND  DIFFERENCE  A TIME  SERIES 

ESTIMATE  CALCULATE  MAX  LIKELIHOOD  ESTIMATES  OF  ARMA  PARAMETERS 

SIMULATE  SIMULATE  TIME  SERIES  FROM  A GIVEN  MODEL 

FORECAST  FORECAST  FUTURE  VALUES,  CONSTRUCT  CONFIDENCE  INTERVALS 


WOULD  YOU  LIKE  MORE  INFO? 

y 

ENTER  OPTION  YOU  WANT  INFO  ABOUT, 
d 

DIFF  THIS  PROGRAM  TRANSFORMS  A GIVEN  TIME  SERIES  WITH  A LOG 

OR  AN  EXPONENTIAL  TRANSFORMATION  (OR  NO  TRANSFORMATION  IF  DESIRED) , AND 
THEN  TAKES  SEASONAL  AND/OR  SIMPLE  DIFFERENCES  OF  ANY  SPECIFIED  ORDERS. 
IT  THEN  OUTPUTS  THE  TRANSFORMED  AND  DIFFERENCED  TIME  SERIES.  THIS 
PROGRAM  IS  USED  TO  ATTEMPT  TO  MAKE  A SEASONAL  OR  A NONSTATIONARY  TIME 
SERIES  OF  THE  FORM  THAT  CAN  BE  HANDLED  VIA  BOX- JENKINS  TECHNIQUES.  IT 
USES  FTRDIF  FROM  THE  IMSL  LIBRARY. 

DO  YOU  WANT  TO  TRY  A SESSION? 

y 

ENTER  THE  LETTER  FOR  OPTION  YOU  WANT. 

P 

IS  YOUR  DATA  IN  FILE  FT02F001? 

y 

EXECUTION  BEGINS . . . 

ENTER  TITLE  FOR  PLOT, 
series  c,  original  values. 

TIME  SERIES  PLOTS  HAVE  BEEN  PRINTED  OFFLINE. 

DO  YOU  WANT  TO  TRY  AGAIN? 

y 

ENTER  LETTER  FOR  OPTION  YOU  WANT, 
a 

IS  YOUR  DATA  IN  FILE  FT02F001? 

y 

♦EXECUTION  BEGINS... 

AUTOCORRELATIONS 

0.978  0.944  0.902  0.854  0.802  0.748  0.692  0.635  0.570  0.523 

0.468  0.413  0.359  0.305  0.253  0.201  0.150  0.098  0.047  -0.003 

PARTIAL  AUTOCORRELATIONS 

0.978  -0.260  -0.157  -0.093  -0.058  -0.045  -0.012  -0.038  -0.022  -0.010 

-0.036  -0.041  -0.038  -0.024  -0.037  -0.027  -0.032  -0.070  -0.048  -0.024 
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IT 


MEAN=  22.9739  VARIANCE  = 4. 22273 

ENTER  TITLE  FOR  PLOTS, 
series  C,  original  values. 

YOUR  AUTO  AND  PAUTO  PLOTS  HAVE  BEEN  PRINTED  OFFLINE. 

PICK  UP  IN  ROOM  1140  UNDER  YOUR  USED  ID  NUMBER. 

DO  YOU  WANT  TO  TRY  AGAIN? 

y 

ENTER  LETTER  FOR  OPTION  YOU  WANT, 
d 

IS  YOUR  DATA  IN  FILE  FT02F001? 

y 

EXECUTION  BEGINS. . . 

IS  YOUR  TIME  SERIES  SEASONAL? 
n 

ENTER  NUMBER  OF  NONSEASONAL  DIFFERENCES. 

1 

DO  YOU  WANT  A LOG  TRANSFORMATION? 
n 

DO  YOU  WANT  AN  EXPONENTIAL  TRANSFORMATION? 
n 

YOUR  TRANSFORMED  TIME  SERIES  HAS  BEEN  STORED  IN  FILE  FT03F001. 

DO  YOU  WANT  A PRINTOUT  OF  THE  FIRST  TEN  VALUES? 

y 

THE  FIRST  10  VALUES  OF  THE  TRANSFORMED  SERIES  ARE: 

0.400009  0.099991  0.0  0.0  0.0 

-0.199997  -0.100006  -0.099991  -0.300003  -0.399994 

DO  YOU  WANT  TO  PLOT  AUTO  ANDPAUTO  OF  TRANSFORMED  DATA? 

y 

♦EXECUTION  BEGINS... 

AUTOCORRELATIONS 

0.805  0.653  0.526  0.442  0.380  0.318  0.262  0.186  0.139  0.144 

0.097  0.094  0.074  0.073  0.070  0.072  0.089  0.043  0.041  0.040 

PARTIAL  AUTOCORRELATIONS 

0.805  0.010  -0.007  0.051  0.027  -0.019  -0.013  -0.080  0.020  0.117 

-0.137  0.094  -0.027  0.034  0.006  0.012  0.043  -0.121  0.060  -0.005 

MEAN=-.346667E-01  VARIANCE  = 0.531982E-01 

ENTER  TITLE  FOR  PLOTS. 

series  C,  1st  order  differenced  data. 

YOUR  AUTO  AND  PAUTO  PLOTS  HAVE  BEEN  PRINTED  OFFLINE. 

PICK  UP  IN  ROOM  1140  UNDER  YOUR  USED  ID  NUMBER. 

DO  YOU  WANT  TO  TRY  AGAIN? 

y 

ENTER  LETTER  FOR  OPTION  YOU  WANT, 
d 
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IS  YOUR  DATA  IN  FILE  FT02F001? 

Y 

EXECUTION  BEGINS... 

IS  YOUR  TIME  SERIES  SEASONAL? 
n 

ENTER  NUMBER  OF  NONSEASONAL  DIFFERENCES. 
2 

DO  YOU  WANT  A LOG  TRANSFORMATION? 


n 

DO  YOU  WANT  AN  EXPONENTIAL  TRANSFORMATION? 


n 

YOUR  TRANSFORMED  TIME  SERIES  HAS  BEEN  STORED  IN  FILE  FT03F001. 

DO  YOU  WANT  A PRINTOUT  OF  THE  FIRST  TEN  VALUES? 

y 

THE  FIRST  10  VALUES  OF  THE  TRANSFORMED  SERIES  ARE: 

-0.300018  -0.099991  0.0  0.0  -0.199997 

0.099991  0.000015  -0.200012  -0.099991  0.199982 

DO  YOU  WANT  TO  PLOT  AUTO  AND  PAUTO  OF  TRANSFORMED  DATA? 

y 

EXECUTION  BEGINS. . . 

AUTOCORRELATIONS 


-0.079  -0.065 
-0.122  0.072 


-0.022  -0.063 
-0.077  0.029 


0.013  -0.018 
-0.011  -0.058 


0.049  -0.052 
0.171  -0.101 


-0.124  0.122 

-0.013  -0.020 


PARTIAL  AUTOCORELLATIONS 


-0.079  -0.072 
-0.143  0.023 


-0.135  -0.094 
-0.089  -0.005 


-0.022  -0.051 
-0.039  -0.074 


0.022  -0.060 
0.130  -0.105 


-0.145  0.094 

-0.014  -0.053 


MEAN=-. 267866E-02  VARIANCE  = 0.198143E-01 
ENTER  TITLE  FOR  PLOTS, 
series  C,  2nd  order  differenced  data. 

YOUR  AUTO  AND  PAUTO  PLOTS  HAVE  BEEN  PRINTED  OFFLINE. 
PICK  UP  IN  ROOM  1140  UNDER  YOUR  USER  ID  NUMBER. 


DO  YOU  WANT  TO  TRY  AGAIN? 

y 

ENTER  LETTER  FOR  OPTION  YOU  WANT, 
e 

*IS  YOUR  DATA  IN  FILE  FT02F001? 

y 

EXECUTION  BEGINS... 

ENTER  NUMBER  OF  AR  PARAMETERS. 

0 

ENTER  NUMBER  OF  MA  PARAMETERS. 

2 

ENTER  NUMBER  OF  DIFFERENCES. 

2 

* LENGTH  OF  TIME  SERIES  = 226 

0 AR  PARAMETERS  2 MA  PARAMETERS  2 DIFFERENCES. 
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ESTIMATED  AR  PARAMETERS  (UNDIFFERENCED  FORM)  ARE: 

PHI ( 1)  = 2.0000 

PHI (2)  = -1.0000 

ESTIMATED  MA  PARAMETERS  ARE: 

THETA ( 1 ) = 0.1382 

THETA (2)  = 0.1300 

MA  CONSTANT3  0.0  WHITE  NOISE  VARIANCE3  0.189515E-01 


DO  YOU  WANT  TO  PLOT  AUTO  AND  PAUTO  OF  RESIDUALS? 

y 

EXECUTION  BEGINS... 

AUTOCORRELATIONS 

0.020  0.034  -0.134  -0.093  -0.012  -0.043  0.004  -0.063  -0.125  0.088 
-0.142  0.069  -0.098  0.025  0.004  -0.055  0.157  -0.089  0.006  -0.006 

PARTIAL  AUTOCORRELATIONS 


0.020  0.034 

-0.166  0.017 


-0.135  -0.090 
-0.104  -0.013 


0.001  -0.055 
-0.025  -0.090 


-0.019  -0.071 
0.123  -0.123 


-0.143  0.087 

-0.021  -0.025 


MEAN3- .124975E-02 


VARIANCE  = 0.189499E-01 


ENTER  TITLE  FOR  PLOTS, 
series  C,  residuals  of  (0,2,2). 

YOUR  AUTO  AND  PAUTO  PLOTS  HAVE  BEEN  PRINTED  OFFLINE. 
PICK  UP  IN  ROOM  1140  UNDER  YOUR  USER  ID  NUMBER. 

DO  YOU  WANT  TO  CHECK  RESIDUALS? 


y 

EXECUTION  BEGINS... 

ENTER  IP,  NO  OF  AR  PARAMETERS  (DIFFERENCED  FORM) . 

0 

ENTER  IQ,  NO  OF  MA  PARAMETERS. 

2 

* CHI  SQUARE  LACK  OF  FIT  VALUE  = 28.74  DF  = 18  SIGNIFICANCE  = 0.0516 


DO  YOU  WANT  TO  TRY  AGAIN? 

y 

ENTER  LETTER  FOR  OPTION  YOU  WANT, 
e 

IS  YOUR  DATA  IN  FILE  FT02F001? 

y 

EXECUTION  BEGINS... 

ENTER  NUMBER  OF  AR  PARAMETERS. 

1 

ENTER  NUMBER  OF  MA  PARAMETERS. 

0 

ENTER  NUMBER  OF  DIFFERENCES. 

1 
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LENGTH  OF  TIME  SERIES  = 226 

1 AR  PARAMETERS  0 MA  PARAMETERS  1 DIFFERENCES. 

ESTIMATED  AR  PARAMETERS  (UNDIFFERENCED  FORM)  ARE: 

PHI  (1)  = 1.8131 

PHI (2)  = -0.8131 

MA  CONSTANT^  0.0  WHITE  NOISE  VARIANCE^  0.178396E-01 


DO  YOU  WANT  TO  PLOT  AUTO  AND  PAUTO  OF  RESIDUALS? 

y 

EXECUTION  BEGINS... 

AUTOCORRELATIONS 


0.010  0.009 

-0.095  0.083 


-0.053  -0.009 
-0.056  0.042 


0.058  0.021 

0.006  -0.041 


0.076  -0.020 
0.169  -0.080 


-0.089  0.130 

-0.001  -0.010 


PARTIAL  AUTOCORRELATIONS 


0.010  0.009 

-0.104  0.069 


-0.054  -0.008 
-0.048  0.041 


0.060  0.018 
0.003  -0.030 


0.074  -0.016 
0.157  -0.087 


-0.088  0.140 

0.013  -0.025 


MEAN=-. 897935E-02  VARIANCE  = 0.177590E-01 


ENTER  TITLE  FOR  PLOTS, 
series  C,  residuals  of  (1,1,0). 

YOUR  AUTO  AND  PAUTO  PLOTS  HAVE  BEEN  PRINTED  OFFLINE. 

PICK  UP  IN  ROOM  1140  UNDER  YOUR  USER  ID  NUMBER. 

DO  YOU  WANT  TO  CHECK  RESIDUALS? 

y 

EXECUTION  BEGINS... 

ENTER  IP,  NO  OF  AR  PARAMETERS  (DIFFERENCED  FORM). 

1 

ENTER  IQ,  NO  OF  MA  PARAMETERS. 

0 

* CHI  SQUARE  LACK  OF  FIT  VALUE  =21.53  DF=19  SIGNIFICANCE  = 0.3082 


DO  YOU  WANT  TO  TRY  AGAIN? 

y 

ENTER  LETTER  FOR  OPTION  YOU  WANT, 
f 

IS  YOUR  DATA  IN  FILE  FT02F001? 
y 

EXECUTION  BEGINS... 

ARE  YOUR  PARAMETERS  IN  DIFFERENCED  FORM? 

y 

ENTER  NO.  AR  PARAMETERS  (DIFF.  FORM). 

1 

ENTER  AR  PARAMETER  PHI(l). 

0.8131 

AR  PARAMETERS  ARE: 

0.8131 
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ARE  THESE  OK? 

y 

ENTER  NUMBER  OF  MA  PARAMETERS. 

0 

ENTER  OVERALL  MA  CONSTANT 
0 

ENTER  THE  NUMBER  OF  DIFFERENCES. 

1 

ENTER  INDEX  FOR  FORECAST  ORIGIN  VIA  13. 

020 

ENTER  FORECAST  LEAD  TIME  VIA  12. 

13 

ENTER  SIGNIFICANCE  LEVEL,  ALPHA. 

.2 

MA  CONSTANT  =0.0  NUMBER  OF  DIFFERENCES  = 1 

FORECAST  LEAD  TIME  = 13  ALPHA  = 0.200 
ARE  THESE  OK? 

y 

AR  PARAMETERS  IN  UNDIFFERENCED  FORM  ARE: 

1.8131 

-0.8131 


THE  FORECASTS  ARE: 


23.1561 

22.9577 

22.7964 

22.6653 

22.5586 

22.4719 

22.4014 

22.3440 

22.2974 

22.2594 

22.2286 

22.2035 

22.1831 

80.0%  CONFIDENCE  LIMITS  : 

FOR  FORECASTS  ARE: 

0.172546 

0.357271 

0.556689 

0.761553 

0.966487 

1.16832 

1.36521 

1.55613 

1.74057 

1.91838 

2.08959 

2.25439 

2.41304 

PS I WEIGHTS 

FOR  FORECAST 

ERROR  ARE: 

1.81310 

2.47423 

3.01180 

3.44889 

3.80429 

4.09326 

4.32822 

4.51927 

4.67461 

4.80092 

4.90362 

4.98712 

5.05501 

WHITE  NOISE 

VARIANCE  = 0 

. 18127E-01 

DO  YOU  WANT  TO  PLOT  FORECASTS? 

y 

*DO  YOU  REALLY  WANT  TO  PLOT  FORECASTS? 

y 

YOUR  TIME  SERIES  AND  FORECASTS  ARE  PLOTTED  OFFLINE. 
DO  YOU  WANT  TO  TRY  AGAIN? 

y 

ENTER  LETTER  FOR  OPTION  YOU  WANT, 
s 

IS  YOUR  DATA  IN  FILE  FT02F001? 

y 

‘EXECUTION  BEGINS. . . 

ENTER  NUMBER  OF  AR  PARAMETERS  (UNDIFFERENCED  FORM) . 
2 
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ENTER  ESTIMATED  AR  PARAMETER  PHI(l) . 

1.8131 

ENTER  ESTIMATED  AR  PARAMETER  PHI (2) . 

-0.8131 

AR  PARAMETERS  ARE  1.8131  -0.8131 

ARE  THESE  OK? 

y 

ENTER  NUMBER  OF  MA  PARAMETERS 
0 

ENTER  OVERALL  MA  CONSTANT 
0 

ENTER  ESTIMATED  WHITE  NOISE  VAR 
0.0178 

DO  YOU  WANT  TO  INPUT  STARTING  VALUES? 
n 

START (1)=  19.000000 

START (2) = 18.799988 

STARTING  VALUE (S)  OK? 

y 

ENTER  RANDOM  NUMBER  SEED  (BETWEEN  0 AND  1) . 
0.7659376 

ENTER  NUMBER  OF  VALUES  YOU  WANT  TO  SIMULATE  VIA  13. 
065 

DO  YOU  WANT  MORE  THAN  ONE  SIMULATED  SERIES? 

y 

HOW  MANY  SERIES  DO  YOU  WANT  (LE.9)? 

2 

DO  YOU  WANT  A PRINTOUT  OF  THE  FIRST  20  VALUES? 

y 

SIMULATED  VALUES  ARE: 


18.640320 

18.511963 

18.631088 

18.626831 

18.787552 

18.963882 

19.074707 

19.205811 

19.261063 

19.633331 

19.821411 

20.042175 

20.166321 

20.347610 

20.369507 

20.216507 

20.133179 

20.174332 

20.287857 

20.241806 

DO  YOU  WANT  TO  PLOT  SIMULATED  VALUES? 

y 

EXECUTION  BEGINS . . . 

ENTER  TITLE  FOR  PLOT. 

simulations  with  model  (1,1,0) , series  C. 
TIME  SERIES  PLOTS  HAVE  BEEN  PRINTED  OFFLINE. 

DO  YOU  WANT  TO  TRY  AGAIN? 
n 

CONTROL  RETURNED  TO  CMS 
R; 
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